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Abstract 

We study a number of two-user interference networks with multiple-antenna transmitters/receivers 
(MIMO), transmitter side information in the form of linear combinations (over an appropriate finite-field) 
of the information messages, and two-hop relaying. We start with a Cognitive Interference Channel (CIC) 
where one of the transmitters (non-cognitive) has knowledge of a rank-1 linear combination of the two 
information messages, while the other transmitter (cognitive) has access to a rank-2 linear combination 
of the same messages. This is referred to as the Network-Coded CIC, since such linear combination 
may be the result of some random linear network coding scheme implemented in the backbone wired 
network. For such channel we develop an achievable region based on a few novel concepts: Precoded 
Compute and Forward (PCoF) with Channel Integer Alignment (CIA), combined with standard Dirty- 
Paper Coding. We also develop a capacity region outer bound and find the sum symmetric Generalized 
Degrees of Freedom (GDoF) of the Network-Coded CIC. Through the GDoF characterization, we show 
that knowing "mixed data" (linear combinations of the information messages) provides an unbounded 
spectral efficiency gain over the classical CIC counterpart, if the ratio (in dB) of signal-to-noise (SNR) to 
interference-to-noise (INR) is larger than certain threshold. Then, we consider a Gaussian relay network 
having the two-user MIMO IC as the main building block. We use PCoF with CIA to convert the 
MIMO IC into a deterministic finite-field IC. Then, we use a linear precoding scheme over the finite- 
field to eliminate interference in the finite-field domain. Using this unified approach, we characterize 
the symmetric sum rate of the two-user MIMO IC with coordination, cognition, and two-hops. We also 
provide finite-SNR results (not just degrees of freedom) which show that the proposed coding schemes 
are competitive against state of the art interference avoidance based on orthogonal access, for standard 
randomly generated Rayleigh fading channels. 

Index Terms 

Interference Channel, Nested Lattice Codes, Compute and Forward, Generalized Degrees of Freedom, 
Network Coding 



I. Introduction 

Interference is one of the fundamental aspects of wireless communication networks. Although the 
full characterization of the Interference Channel (IC) capacity is elusive, much progress has been made 
in recent years. The capacity region of the two-user Gaussian IC was characterized within 1 bit, by 
using superposition coding with an appropriate power allocation of the private and common message 
codewords, and by providing a new upper bounding technique [1|. Degrees of Freedom (DoF) results are 
obtained under the assumption of full channel knowledge for the two-user multiple input multiple output 
(MIMO) IC with arbitrary number of antennas at each node in Q, for the K user IC with time-varying 
or frequency-selective channels in ||3|], for the i^-user IC with constant channel coefficients in |4|, and 
for the K-user IC with multiple antennas fSl. Also, Generalized DoF (GDoF) results are found for the 
two-user MIMO IC in ||6j| and for symmetric K user IC in I?]. Further results for various interference 
networks can be found in ||8|. 

In many practical communication systems, transmitters or receivers are not isolated. For example, 
in cellular systems the base stations are connected through a wired backhaul network through which 
information messages and some form of channel state information or coordination can be shared ||9|- 



|13|. In wired networks, routing is generally optimal only for the single-source single-destination case 
1 14|. In the more general case of multiple sources and multiple destinations (multi- source multicasting), 
linear network coding is known to achieve the min-cut max-flow bound p3| . In practice, random linear 
network coding is of particular interest for its simplicity. In this case, intermediate nodes forward liner 
combinations of the incoming messages by randomly and independently choosing the coefficients from 



an appropriate finite-field |16|. Going back to the cellular systems case, if random linear network coding 
is used in the backhaul network, the base stations obtain linear combination of the messages instead of 
individual messages. If the backhaul link serving a given base station has capacity large enough, the 
rank (per unit time) of such linear combinations is equal to the number of independent messages (per 
unit time), so that the base station knows all the messages. In contrast, if the backhaul link is a capacity 
bottleneck, the rank (per unit time) is less than the number of independent messages (per unit time). 
In this case, the base station has access to "mixed data", i.e., rank-deficient linear combinations of the 
messages. We refer to this model as the Network-Coded Cognitive IC (CIC). 

An example of Network-Coded CIC is shown in Fig. [T] including a cellular BS and a home BS (e.g., a 
femtocell access point). The cellular BS is connected to the data router, which generates both messages, 
via a high capacity link supporting rate 2Rq. The home BS is connected to the same data router via 
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Fig. 1. In the classical CIC, the data router sends the one of information messages to the non-cognitive transmitter (i.e., 
//(wjjWj) = Wj). In the Network-Coded CIC, the data router forwards "mixed data" to the non-cognitive transmitter (i.e., 
i(wi,W2) = w^ ewa). 



lower capacity link supporting only rate Rq. In this case, the data router sends two information messages 
to the cellular BS (equivalently, a rank-2 linear combination thereof) and a rank-1 linear combination 
of the messages to the home BS. In the case of routing, this linear combination has coefficients and 



1, reducing to the classical CIC, which has been extensively investigated in the literature |17|-|19|. In 
particular, the Gaussian CIC capacity region was approximately characterized within 1.87 bits in 1*201. 
If general network coding is used instead of routing, the rank- 1 linear combination has generally non- 
zero coefficients and therefore contains mixed data. Notice that in the model of Fig. [T] mixed data can 
be provided without violating the backhaul capacity constraint of Rq. At this point, a natural question 
arises: Does mixed data at the "non-cognitive" transmitter provide a capacity increase "for free" for 
the Network-Coded CIC over the conventional CIC? 

Proceeding along this line, we observe that a basic level of multicell cooperation named interference 
coordination has been investigated and it is currently considered in industry for its practical aspects. 
A simple two-user model for interference coordination consists of a data router, two M-antenna base 
stations, and two M-antenna user receivers. The wired backhaul links from the router to the base stations 
have the same capacity equal to Rq. The data router has no knowledge on channel state information 
(CSI), due to the separation between physical layer and network layer. However, the base stations have 
full CSI of both the direct and interfering links, obtained from the users through feedback channels. 
Such CSI knowledge allows the base stations to coordinate in their sharing strategies such as power 
allocation and beamforming directions [21]. From an information theoretic viewpoint, this model is a 



two-user MIMO IC. Assuming that network coding is used in the backhaul, the data router can deliver 
linear combinations of the messages instead of individual messages, at the same cost in terms of backhaul 
capacity constraint. Hence, the model becomes a two-user MIMO IC with mixed data at both transmitters, 
and shall be referred to as the Network-Coded Interference Coordination Channel (ICC). In this paper, 
we address the following question: Does mixed data at the transmitters provide a capacity increase "for 
free" for the Network-Coded ICC over conventional interference coordination? 

Finally, building on the insight gained in the above Network-Coded cognitive networks, we study the 
2x2x2 MIMO IC, consisting of two transmitters (sources), two relays, and two receivers (destinations), 
where nodes have M multiple antennas. This model is non-cognitive but has some commonality with 
the previous models in the sense that it consists of two cascaded two-user MIMO ICs where the relay 
can have access to mixed messages if proper alignment and coding over the finite-fields is used in the 
first hop. The 2x2x2 Gaussian IC has received much attention recently, being one of the fundamental 
building blocks to characterize two-flow networks p2|. One natural approach is to consider this model 



as a cascade of two ICs. In | [23| , the authors apply the Han-Kobayashi scheme p4| for the first hop to 
split each message into private and common parts. Relays can cooperate using the shred information (i.e., 
common messages) for the second hop, in order to enhance the data rates. This approach is known to 
be highly suboptimal at high SNR, since two-user IC can only achieve 1 DoF. In [25] it was shown that 
I DoF is achievable by viewing each hop as an X-channel. This is accomplished using the interference 
alignment scheme for each hop. More recently, the optimal DoF was obtained in |26] using a new 
scheme called aligned interference neutralization, which appropriately combines interference alignment 
and interference neutralization. Also, the K x K x K Gaussian IC was recently studied in |27l, where 
it is shown that the K x K MIMO cut-set upper bound (equal to K) can be effectively achieved using 
aligned network diagonalization. 

A. Contributions 

1) Network-Coded CIC: single antenna case: We characterize the capacity region of a finite-field 
Network-Coded CIC using distributed zero-forcing precoding. We notice that this region is equivalent to 
that of a finite-field vector broadcast channel. This shows that in this case partial cooperation yields the 
same performance of full cooperation, as long as the non-cognitive transmitter knows the mixed message 
rather than its own individual message only. Thus, we conclude that mixed data at the non-cognitive 
transmitter can increase capacity. It is worthwhile noticing that the finite-field model is itself meaningful 
in practical wireless communication systems, by the observation that the main bottleneck of a digital 
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Fig. 2. The generalized degrees-of-freedom (GDoF) of the two-user Gaussian Network-Coded CIC. For the interference regimes 
with p > 1/2, the gap between the Network-Coded CIC and CIC becomes arbitrarily large as SNR and INR goes to infinity. 
This shows that mixed-data at the non-cognitive transmitter can provide the unbounded capacity gain at high SNR. 



receiver is the Analog to Digital Conversion (ADC), which is costly, power-hungry, and does not scale 
with Moore's law. Rather, the number of bit per second produced by ADC is roughly a constant that 



depends on the power consumption |28|, |29|. Therefore, it makes sense to consider the ADC as part 



of the channel. This, together with the algebraic structure induced by lattice coding, produces a finite- 



field model as shown by the authors in |13|, |30|. Motivated by this first successful result, we present 
a novel scheme nicknamed Precoded Compute-and-Forward (PCoF) for Gaussian Network-Coded CIC. 
CoF makes use of nested lattice codes, such that each receiver can reliably decode a linear combination 
with integer coefficient of the interfering codewords |33|. Thanks to the fact that lattice are modules over 
the ring of integers, this linear combination translates directly into a linear combination of the information 
messages defined over a suitable finite-field. For brevity, we refer to this fact as "lattice linearity" in the 
following. Finally, the interference in the finite-field domain is completely eliminated by distributed zero 
forcing precoding (over finite-field). This scheme can be thought of as a distributed approach of Reverse 



CoF (RCoF), proposed by the authors in |12|, |13| for the downlink of distributed antenna systems. 



Another novel contribution of this work is the characterization of the sum GDoF (see (H and definition 



in (661) of the Gaussian Network-Coded CIC. This is obtained by combining an improved achievability 
result that makes use of both Dirty-Paper Coding (DPC) and PCoF, with a new outer bound on the 
sum rate. As a consequence of the GDoF analysis, we show that mixed data can provide an unbounded 
capacity gain with respect to conventional cognitive interference channels. As shown in Fig. [2] the sum 
GDoF of the Network-Coded CIC is larger than the sum GDoF of the standard IC when p = wg^R , 
the ratio of the interference power over the direct link power (expressed in dB), is larger than 1/3, and 
it is larger than the sum GDoF of the standard CIC when p > 1/2. In contrast, for p < 1/2 it is better 
not to mix the data on the side information link to the non-cognitive transmitter (i.e., use routing in the 
backhaul link). It is also interesting to notice that for p = 1 the use of mixed data provides the same 
two degrees of freedom of full cooperation (two-users vector broadcast channel), mimicking the result 
of the finite-field case. 

2) Network-Coded MIMO IC: As anticipated before, we have considered three communication models 
having the two-user MIMO IC as a building block: Network-Coded ICC (representative of a cellular 
system downlink with interference coordination), Network-Coded CIC with MIMO generalization, and 



2 X 2 X 2 IC p2| , p6l . In all these models, we assume that all nodes have M transmit/receive antennas. Our 
coding scheme is based on the extension of the PCoF idea to the MIMO case. This scheme consists of two 
phases: 1) Using the CoF framework in order to transform the two-user MIMO IC into a deterministic 
finite-field IC; 2) Using linear precoding on the finite-field domain in order to eliminate interference. 
The main performance bottleneck of CoF consists of the non-integer penalty, which ultimately limits 
the performance of CoF at high SNR p9l. To overcome this bottleneck, we employ Channel Integer 
Alignment (CIA) in order to create an "aligned" channel matrix for which exact integer forcing is possible. 
We derive achievable symmetric sum rate results for all three channel modes and prove that PCoF with 
CIA can achieve sum DoF equal to 2M — 1 in all cases. In particular, for the Network-Coded CIC, we 
prove that the optimal 2M sum DoF is achieved by appropriately combining DPC and PCoF as in the 
scalar case. Beyond the DoF results, we further employ the lattice codes algebraic structure in order 
to obtain good performance at finite SNRs. We use the integer-forcing receiver (IFR) approach of [32] 



and integer-forcing beamforming (IFB), proposed by the authors in |12|, |13|, in order to minimize the 
power penalty at the transmitters. We provide numerical results showing that PCoF with CIA outperforms 
time-sharing even at reasonably moderate SNR, with increasing performance gain as SNR increases. 



B. Organization 

This paper is organized as follows. In Section [llj we summarize some definitions on lattices and lattice 



coding and review CoF. In Section III we characterize the capacity region of finite-field Network-Coded 
CIC and present PCoF, as a natural extension of finite-field scheme, for the Gaussian Network-Coded 
CIC. Further, we derive an achievable rate region of the Gaussian Network-Coded CIC, by appropriately 



combining PCoF and DPC, and characterize the sum GDoF. In Section IV we characterize an achievable 
symmetric sum rate for the two-user MIMO ICs under investigation, and derive the DoF of these channels. 
The sum rates are improved in Section [V] using successive cancellation with respect to CoF. In Section VI 



we optimize the (symmetric) sum rate and provide some numerical results, showing good performance 
at intermediate and "practical" values of SNR. 

II. Preliminaries 

In this section we provide some basic definitions and background results that will be extensively used 
in the sequel. 

A. Nested Lattice Codes 

Let Z[j] be the ring of Gaussian integers and p be a prime. Let © denote the addition over F^ with 

q = p^, and let (7 : Fg — )• C be the natural mapping of Fg onto {a + jb : a,b € Zp} C C. We recall 



the nested lattice code construction given in |33|. Let A = {A = zT : z G ^"[i]} be a lattice in C", 
with full-rank generator matrix T G C"^". Let C = {c = wG : w G F^} denote a linear code over 
¥q with block length n and dimension r, with generator matrix G. The lattice Ai is defined through 



"construction A" (see ||35J and references therein) as 

Ai=p-'g{C)T + A, (1) 

where g{C) is the image of C under the mapping g (applied component-wise). It follows that A C Ai C 
p~^A is a chain of nested lattices, such that |Ai/A| = p'^^ and |p^^A/Ai| = p2(n-r) 

For a lattice A and r G C", we define the lattice quantizer Qa{i.) = argmin;^^^ III — AP» the Voronoi 
region Va = {r G C" : QA(r) = 0}, the modulo reduction [r] mod A = r — (5A(r), and the per- 
component second moment a\ = yj., , J^ Ullpdr. 

Given A and Ai as given above, we define the lattice code £ = Ai n Va with rate R = ^ log \C\ = 
-logq. The set p^^g{C)T is a system of representatives of the cosets of A in Ai. This induces a 
natural labeling / : F^ — )• £ of the codewords of C by the information messages w G F^ defined by 
/(w) = p-^g{wG)T mod A. 



B. Compute-and-Forward and Integer-Forcing 



We recall here the CoF scheme of 1 33 1 applied to a particular case of Gaussian MIMO channel with 
joint processing of the receiver antennas and independent lattice coding at each transmit antenna. Our 
reference model is given by 

Y = HCX + Z (2) 

where H e C^^^^\ C G Z[j]^^^^, X^ G C^^", and where Z contains i.i.d. Gaussian noise samples 
~ CJ\f{0, 1). Here, S denotes the number of independent and independently encoded information streams 
(messages) sent by a virtual "super-user" collecting all channel inputs. For k = I, . . . ,S, each fc-th 
independent message W;, € F^ is encoded input the codeword t^ = /(w^) of the same lattice code C of 
rate R and mapped to the channel input sequence 

Xfc = [tfc + dfc] mod A, (3) 

where the dithering sequences {d^} are mutually independent, uniformly distributed over Va, and known 
to the receiverfM The encoded sequences {x^} are arranged by rows into the transmit signal matrix X. 
We also define the matrix T of dimensions S x n containing {t^} arranged by rows, and the dithering 
matrix D with rows {d;.}. 

Channel matrices in the form HC as in Q will appear several times in this paper as a consequence 
of channel integer alignment, explicitly designed such that [CT] mod A has lattice codewords arranged 
by rowsjj 

The decoder's goal is to recover L < M integer linear combinations of the S lattice codewords, given 
by the rows s^ of the matrix S = [B'~'CT] mod A, for some integer matrix B € Z[j]^^^^. Letting b^ 
denote the ^-th column of B, the receiver computes 



y^ 



a^^Y-b^CD 



mod A 

= [b^CT + a^(HCX + Z)-bVC(T + D)] mod A 
= [s^ + z^ff(HC,b^,a£)] mod A (4) 

'The dithering sequences in this paper have these properties, and this fact will be understood in the sequel even though not 
explicitly stated. 

^The modulo A reduction applied to matrices is intended row by row. 



where a.^ € C and z^jj(HC,b^, ct^) is the C-th effective noise sequence, distributed as: 

qVh - b^^) CD + a^Z . (5) 



non-integer penalty Gaussian noise 

Choosing q^ = b^H^^, the variance of the effective noise is given by 



4,,= ||(H-i)Hb,f. (6) 

This choice is referred to in | [32| as the exact Integer Forcing Receiver (IFR). In this way, the non-integer 
penalty of CoF is completely eliminated. More in general, the decoding performance can be improved 
especially at low SNR by minimizing the effective noise variance with respect to cxi for given b^. This 
yields 

cj2^^ = byC(SNR-il + C^^H^^HC)-^C^b^ (7) 



Since b^ and C are integer- valued, s^ = [b^CT] mod A is a codeword of C. From |33|, we know that 
by applying lattice decoding to y given in (|4j) there exist sequences of lattice codes C of rate R and 
increasing block length n such that s^ can be decoded successfully with arbitrarily high probability as 
n — > oo, provided thaj^ 

R < log- f ^) . (8) 

where the expression in the right-hand side of ([8]l is the computation rate for the modulo-A additive 
noise channel (|4]) with given SNR and effective noise variance. All the L linear combinations can be 
reliably decoded if 

i? < mill J log+ ( ^ I I . (9) 

The requirement that all inputs are encoded with the same lattice code with rate constrained by the 
worst computation rate over the desired linear combinations (columns of B) can be relaxed by allowing 
the inputs to be encoded at different rates, using a family of nested lattice codes, and using successive 



cancellation with respect to CoF, according to the scheme proposed and analyzed in |41 1. The application 
of this idea to the networks treated in this paper is examined in Section [V] 

^We define log^(a::) = inax{log(a::),0}. 
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Using the linearity of lattice encodingn the corresponding L linear combinations over F^ for the 
messages are given by 

U = g~^{[B^] modpZ[j])c/-^([C] modpZ[j])W 

= [B^^],[C],W, (10) 

where we use the notation [B'^Jg = (^^-'^([B'^] mod pZ[j]). 

III. Network-Coded Cognitive Interference Channel 

A two-user Gaussian Network-Coded CIC consists of a Gaussian interference channel where transmitter 
1 (the cognitive transmitter) knows both user 1 and user 2 information messages (or, equivalently, two 
independent linear combinations thereof) and transmitter 2 (the non-cognitive transmitter) only knows 
only one linear combination of the messages. Without loss of generality, we assume that transmitter 1 
knows (W]^, W2), and transmitter 2 has w^©W2, where w^ G F^ denotes the information message desired 
at receiver k, at rate i?^ bit/symbol, for /c = 1, 2. We assume that if Ri 7^ R2 then the lowest rate message 
is zero-padded such that both messages have a common length, given by r = max{ni?i, ni?2}, where 
n denotes the coding block length. A block of n channel uses of the discrete-time complex baseband 
two-user IC is described by 

y^ = /iiixi + /112X2 + zi (11) 

y^ = /121X1 + /122X2 + Z2, (12) 

where z^ G C"^^ contains i.i.d. Gaussian noise samples ~ CA/'(0, 1) and hij G C denotes the channel 
coefficients, assumed to be constant over the whole block of length n and known to all nodes. Also, we 
have a common per-user power constraint, given by ^E[||x^|p] < SNR, for A; = 1,2. Each receiver k 
observes the channel output y and produces an estimate Wj;, of the desired message W;.. A rate pair 
(i2i,i?2) is achievable if there exists a family of codes satisfying the power constraint, such that the 
average decoding error probability satisfies \min-^oo IP(w;, / Wj^,) = 0, for both k = 1,2. 

A. Capacity Region for finite-field Network-Coded CIC 

In order to build an intuition for Gaussian channel, we consider the corresponding finite-field model 
and show that distributed zero-forcing precoding achieves the capacity of finite-field Network-Coded CIC. 

^Lattice encoding linearity refers to the isomorphism between F^ and C induced by the natural labeling / defined before. 
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Fig. 3. Distributed zero-forcing preceding for finite-field Network-Coded CIC. Differently from RLNC, the cognitive transmitter 
carefully chooses the coefficients of linear combination according to the channel coefficients qij 's. 



A block of n channel uses of the discrete-time finite-field IC is described by 

= Q 



^1 

^2 



Xi 



X2 



C2 



(13) 



where Ck ^ ^o contains i.i.d. additive noise samples, and q^j G Fg is the {i,j)-th element of Q, denoting 
the channel coefficients from transmitter j to receiver i, assumed to be constant over the whole block of 
length n and known to all nodes. 

Theorem 1: If det(Q) 7^ and (711, 521 7^ 0, the capacity region of the finite-field Network-Coded CIC 
is the set of all rate pairs (i?i, R2) such that 



Rk < \ogq-H{Ck)fork = l,2. 



(14) 



Proof: We first derive a simple upper bound by assuming the full transmitter cooperation. In this case, 
this model reduces to the finite-field vector broadcast channel. A trivial upper-bound on the broadcast 
capacity region is given by p4}: 



Rk < max I{Xi,X2;Yk) for k = 1,2 



(15) 



Due to the additive noise nature of the channel, we have I{Xi, X2; Yk) = H{Yk) — H{C,k)- Furthermore, 



HiYk) < \ogq and this upper bound is achieved by letting (Xi,X2 



Uniform over F^. This bound 



coincides with ( 14 1. 



Next, we derive an achievable rate using distributed zero-forcing precoding technique. Without loss of 
generality, it is assumed that H{(i) < H{C,2)- We use two nested linear codes C2 ^ Ci where Ck has 
rate i?^ = ^ log (7. Let w^ and Wg be the zero-padded information messages to common length ri. The 
detailed procedures of distributed zero forcing technique is as follows (see Fig. [3]). 
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• Transmitter 1 produces the codewords c^^ = w^G and C2 = W2G, and transmits the precoded 
codeword x^ = mic^ © m2C2, for some coefficients mi, 1112 G F^; 

• Transmitter 2 produces the codeword c^ ©C2 = (w^ © W2)G and transmits the precoded codeword 
X2 = m^ici © C2) with coefficient 7713 G F^. 

• Receiver 1 observes: 



y^ = giixi © ^12x2 © Ci 

= Aiici © A12C2 © Ci 

where An = {qnmi © qum^) and A12 = (911^2 © g'i2"^3)- 
Receiver 2 observes: 

Yn = 921X1 © g'22X2 ® Co 



(16) 
(17) 



A22C9 © A2lC 



21^1 



Co 



(18) 
(19) 



where A22 = (g2i"T'2 © 922"i3) and A21 = (g2i"i-i © q22'm^)- 
The goal is to find a precoding vector m = (ttt-i, 7722, ^n-s)^ to cancel interference at both receivers 
(i.e., such that A12 = A21 = 0), while preserving the desired codewords (i.e., such that Aii,A22 / 0). 
Equivalently, we want to find a non-zero vector m to satisfy the following conditions: 
• Condition 1 {canceling the interferences) 



where 



Cm = (20) 

qii qi2 
921 0^22 

• Condition 2 (preserving the desired signals) 

det(QM) = A11A22 / 

where 

mi mo 
M = . (23) 

7713 ms 
Since Rank(C) < 2, there exist non-zero vectors m* G Null(C) that satisfies Condition 1. Since Q has 
full rank. Condition 2 is equivalent to requiring that M has rank 2, i.e., that 7773(7711 — 7/12) / 0. In 



(21) 



(22) 
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short, we have to find the conditions for which a vector m in the null-space of C satisfies m-^ / and 
nil / "^2- Assuming qn / and g2i / 0, we have that Cm = yields 

Ql2 Q22 

1712 = TTT'S, ITT-l = ITT-S- 

qu 921 

Using this in the expression of M, we find that det(M) / if we choose m^ / and if 

_gi2 922 _ -912921 + 911922 _ det(Q) 
911 921 911921 911921 

By assumption, the above condition is always true, therefore we conclude that a vector m* satisfying 

Conditions 1 and 2 can always be found. In this case, the precoded channel decouples into two parallel 

additive noise channels 

y^ = AiiCi©Ci (24) 

y^ = A22C2©C2, (25) 



for which rates Rk < log 9 — H{C,k) are clearly achievable by linear coding |36|. ■ 

Remark 1: The capacity region of finite-field Network-Coded CIC under the assumptions of Theorem 
[T] is equivalent to the capacity region of the corresponding finite-field vector broadcast channel. In other 
words, partial network-coded cooperation and full cooperation yield the same performance. 

Remark 2: It is interesting to notice that if qu = and det(Q) 7^ 0, then 921,912 7^ 0. This implies 
that m in the null space of C takes on the form (0, m2, 0)^ for some 7712 7^ 0. If 521 = and det(Q) 7^ 0, 
then 911,922 / 0. This implies that m in the null space of C takes on the form (mi, 0,0)^ for some 
mi 7^ 0. In both cases, det(M) = and interference cannot be removed without eliminating the useful 
signal at one of the two receivers. 

The observation in the above remark is strengthened by the following unfeasibility result: 
Lemma 1: Perfect channel decoupling is not possible if the conditions of Theorem [T] do not hold. 
Proof: We will show that if the conditions of Theorem [T] are not satisfied, then it is not possible 
to achieve the sum rate of two perfectly decoupled channels, i.e., the sum rate is lower than 21ogq — 
{H{(i) + H{(2))- We employ the upper bounds derived in Appendix [A| such as 

mm{Ri,R2} < mm{I{Xi;Yi\X2),I{Xi;Y2\X2)} (26) 

max{Ri,R2} < max{I{Xi,X2;Yi),I{Xi,X2;Y2)} (27) 

= log(?-min{ii-(Ci),i^(C2)}. (28) 

Notice that the sum rate is equal to m.m{Ri, R2} + max{i?i, i?2}- When qu = 0, the receiver 
1 observes the Yi = 912^2 © Ci- Then, we have that m.m{Ri, R2} = since I{Xi;Yi\X2) = 
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H{Yi\X2)-H{Yi\Xi,X2) = 0. Using ^ and ^, we have that R1+R2 < logg-mm{F(Ci), -H'(C2)}- 



Similarly when 521 = 0, i.e, Y2 = 922-^2 © C2, the min{i?i, i?2} = is also zero because of 
I{Xi;Y2\X2) = 0. Thus, we have that Ri + R2 < logq - mm{H (d) , H {C2)} ■ In both cases, the 
sum rates are strictly less than 21ogg — (-ff(Ci) + H{(2))- ■ 

B. Scaled Precoded CoF 

Motivated by the above result, we present a novel scheme named Precoded Compute-and-Forward 
(PCoF) for the Gaussian Network-Coded CIC. Using CoF decoding, each receiver can reliably decode an 
integer linear combination of the lattice codewords sent by transmitters. Then, the "interference" in the 
finite-field domain can be completely eliminated by distributed zero-forcing precoding, provided that the 
conditions of Theorem [T] are satisfied. In order to achieve different coding rates while preserving the lattice 
Z[j]-module structure, we use n family of nested lattices A C A2 C Ai, where A^ = p^^g((Cfc)T + A with 
A = Z"[j]T and where Ck denotes the linear code over ¥q generated by the first r^ rows of a generator 
matrix G, with r2 < ri. The corresponding nested lattice codes are given by Ck = A^ n Va, and have 
rate R^ = — logg. We let B = [bi, b2] G Z[j]^^^, where b^ denotes the integer coefficients vector used 
at receiver k for the modulo-A receiver mapping (see a4p, and we let Q = [B^]q G F^^^. For the time 
being, it is assumed that det(Q), qn, (721 7^ over Fg. PCoF proceeds as follows: 

• Transmitters 1 and 2 produce the precoded messages: 

U]^ = miyv^ © 1712^2 (29) 

U2 = m3(wi©W2), (30) 

respectively, where m = (mi, m2, ms) is a non-zero vector m G Null(C) where C is related to Q 
as defined in ( |2T] ). 

• Each transmitter k produces the lattice codeword v^ = /(uyr,) G £1 (the densest lattice code) and 
transmits the channel inputs x^ = [v^. + d^] mod A, where d^ are dithering sequences. 

By lattice linearity we have: 

Yi = g{mi)ti+gim2)t2 mod A (31) 

Y2 = 9{ms)ti + 9{ms)t2 mod A (32) 

where t^ = /(w^). As in the proof of Theorem [T| we choose the precoding vector m = {'mi,m2,ms) 
to satisfy Condition 2, such that 

QM = diag(Aii, A22) for some An, A22 / (33) 
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where M is related to m as defined in S23\. 



Each receiver k appUed the CoF receiver mapping (|4]) with integer coefficients vector b^ and (scalar) 
scaling factor q^, yielding 



b^ff(M) 



+ Zeff(hfe,bfc,afc) 



mod A 



h 



+ Zeff(hA;,bfc,aA:) 



mod A 



(a) 



(b) 



{[h^gm] modpZ[. 






+ z,ff(hfc,bfc,afc) 



mod A 



(34) 



(35) 



(36) 



b(Afcfc)tfc + z^(f(hfc,bfc,ajt)] mod A (37) 

where h^ = [/ifci, hk2], where (a) follows from the fact that [pt\ mod A = for any codeword t G Ck, 
and where (b) is due to the following result: 

Lemma 2: Let Q = [B'"']^. If QM = diag(Aii, A22) over ¥q, then 

[bH5(M)] modpZ[i]=diag(g(Aii),<7(A22)). (38) 

Proof: Using [B*^] mod pZ[j] = ^(Q), we have: 

[B^c/(M)] modpZ[i] = [([B^] mod pZ[j])c/(M)] mod pZ[j] (39) 

= b(Q)5(M)] modpZ[i] (40) 

= b(QM)] modpZ[j] (41) 

= b(diag(Aii,A22))] modpZ[j] (42) 

= diag(r7(Aii),g(A22)) (43) 



From the results summarized in Section II-B we know that lattice decoding applied to the observation 
y at each receiver k can reliably decode the desired message if 

„ ., ^/ SNR \ 

Ru < loe 



ybH(SNR-il + hfchH)-ibfcy ■ 
This rate can be improved if each transmitter k scales its signal by some factor /3fc G V, where V = 
{/3 G C : |/3| < 1} denotes the unit disk in C. This choice of Pk guarantees that the power constraint is 
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Fig. 4. Encoding and decoding structures of the proposed achievability scheme. Transmitter 1 uses the DPC to cancel the 
interference at its intended receiver 1, and also performs precoding over finite-field to eliminate the interference at receiver 2. 



satisfied at each transmitter. The effective channel matrix induced by this scaling is given by 



(44) 



^ /3l/l21 P2h22 

We have: 

Theorem 2: Scaled PCoF applied to Gaussian Network-Coded CIC with H = [hij] G C^^^ achieves 
the rate pairs (i?i,i?2) such that 

. + ( SNR \ 



bH(SNR-il + hfchH)-ibfcy ' 
for any full rank integer matrix B = [bi,b2] with 611,621 / mod pZ[j] and /3i,/32 € V, where 
hfc = [/3i/ifci, ^2/1^2]- ■ 

C. An achievable rate region for the Gaussian Network-Coded CIC 



It was shown in Section III that distributed zero-forcing precoding is optimal for finite-field Network- 



Coded CIC. In the Gaussian case, however, the channel coefficients are not integers and hence Scaled 
PCoF may not be optimal due to the non-integer penalty. Using the fact that transmitter 1 has non-causal 
information of message 2, we can completely eliminate the interference of signal from transmitter 2 at 



receiver 1 by using DPC |31 1. Also, we can remove the non-integer penalty at the receiver 2 by using 
Scaled PCoF with a careful choice of the scaling factor of transmitter 2. In other words, while Scaled 
PCoF cannot simultaneously remove the non-integer penalty at both receivers, it can completely eliminate 
it at receiver 2, while interference at receiver 1 is handled by DPC precoding. 

We let b = [61,62] € ^[j]^ denote the integer coefficients vector used at receiver 2 for the CoF 
receiver mapping ^, and we let q^ = [hk\q. Again, it is assumed that gi, 52 7^ over Fg. The proposed 
achievability scheme proceeds as follows (see Fig. |4]): 
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Transmitter 2 produces the lattice codeword V2 = /(w^ © W2) and produces the channel input with 
power scaling factor P eV: 

X2 = /3x2 (45) 

where X2 = [v2 + d2] mod A. 

Transmitter 1 produces the precoded message ttiw^^ where m € Fq is given by 

qimeq2 = 0^m={qi)-^{-q2), (46) 

where (gi)"^ denotes the multiplicative inverse of qi and (—92) denotes the additive inverse of q2. 
Then, it uses DPC for the known interference signal /112X2 and forms: 

xi = [Yi - ai(/ii2//iii)x2 + di] mod A, (47) 

where \_i = f{mwi). 



By lattice linearity we have: 



g{m)ti mod A (48) 

1 1 + t2 mod A (49) 



where t^^ = /(w^^) and tg = /(wg). Receivers 1 and 2 observe the y and y given in (111 and (12 1, 
respectively. Receiver 1 performs the inflated modulo-lattice mapping as y = [aiy /hu — d^^] mod A. 
This results in the mod- A additive noise channel given by: 

ti = [(ai/^11) [^11X1 + ^12X2 + zi] -di] mod A (50) 

= [vi — vi + aiXi + ai(/ii2//iii)x2 + (ai/^ii)zi - d]^] mod A (51) 

= [vi — (1 — ai)ui + {cii/hii)zi] mod A (52) 

where u^^ is uniformly distributed on Va and is independent of z^ and v^ by the independence and 



uniformity of dithering and by the Crypto Lemma. From standard DPC results |37|, choosing 



l + SNR|/iii 
the coding rate Ri is achievable if 



A SNR|/in|2 

ai — «1,MMSE — , , rMnl, 19' (-'-'-' 



-Ri <log(l + |/iiipSNR). (54) 
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Letting h(/3) = [/i2i,/3^22] with /122 = ^22 — ai,MMSE^i2/i2i/^ii> receiver 2 applies the CoF receiver 
mapping ^ with integer coefficients b and scaling factor a2 = fei/^21. yielding 



[a2y, — bidi ~ ^2d2] mod A 



(55) 



[^lYi + 62Y2 + a2(/i2ixi + ^22X2 + Z2) - 61 [vj^ + di] - 62 [Y2 + d2]] mod A (56) 

[^lYi + 62Y2 + ("2/121 - &i)[Xi + di] + («2/3^22 - &2)x2 + a2/i2iA + "222] ™od A(57) 



(a) 



+ ibi(3h22/h2i - 62)112 + (6l/^2l)z2 



mod A 



c/(m) 
1 1 



mod p1^[j] 



ti 



+ z.„(h(/3),b) 



mod A 



(b) 



[([62] mod pZ[j])t2 + z^ff(h(/3), b)] mod A 



(58) 



(59) 



(60) 



where U2 is uniformly distributed on Va and is independent of v^^, V2, and Z2 by the independence and 
uniformity of dithering and by the Crypto Lemma, A = Qa{Xi — ai,MMSE/3(^i2//'^ii)x2 + di), (a) is due 
to the fact that 02/121 A = &iA G A, and (6) follows from the fact that m is chosen to satisfy the ( |46] ), 
i.e., big{m) + 62 mod pZ[j] = 0. Furthermore, we define 

Z^„(h(/3), b) = (5l/3/l22//l21 - f'2)u2 + (6l//l2l)z2- (61) 

Receiver 2 decodes t2 by applying lattice decoding to y^- This yields the achievable region given by: 

Theorem 3: Scaled PCoF and DPC applied to Gaussian Network-Coded CIC with H = [hij] G C^""^ 
achieves the rate pairs (i?i,i?2) such that 



Ri < log(l + |/iii|2SNR) 
R . 1 ^. SNR 

/t2 < log 



<rUP) 



(62) 
(63) 



for any b G Z[i]^ with 6i, 62 / mod p'L[j] and any /3 G "P, where 



^e'.(/3) 



01- 62 



i21 



SNR + 



61 



'-21 



Example 1: We evaluate the performance of proposed schemes with respect to their average achievable 
sum rate, where averaging is with respect to the channel realizations with i.i.d. coefficients hij ~ 
CJ\f{0, 1). Also, we considered the performance of full-cooperation (i.e., vector broadcast channel with 



sum-power constraint (see for example 1 38 1 for an efficient algorithm to compute the vector broadcast 
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- Full Cooperation (Vector BC) 
-• — Scaled PCoF and DPC 
-e — Scaled PCoF 




Fig. 5. Average sum rate for Gaussian Network-Coded CIC with i.i.d. cliannel coefficients ^ CA/'(0, 1). 

channel sum-capacity). In Fig. [5] Scaled PCoF shows the satisfactory performance in the moderate SNRs 
(i.e., SNR < 20 dB). Yet, this scheme suffers from the non-integer penalty at high SNRs. Remarkably, 
Scaled PCoF with DPC (and optimization with respect to the scaling factor f3 in Theorem [3]l performs 
within a constant gap with respect to full-cooperation at any SNR. 



D. Generalized Degrees of Freedom 

In the high SNR regime, a useful proxy for the performance of wireless networks is provided by the 
Generalized Degrees-of-Freedom (GDoFs), which characterize the capacity pre-log factor in different 
relative scaling regimes of the channel coefficients, as SNR grows to infinity |[l|. In this section we study 
the symmetric GDoFs. In particular, we consider the following channel model: 



/iiiVSNRx^ + /ii2\/lNRx2 + z^ 



/i2i\/lNRxi + /122VSNRX2 + Z2 



(64) 
(65) 
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where hij G C ai^e bounded non-zero constants independent of SNR, INR, Z/^ is the i.i.d. Gaussian noise 
~ C7V(0,1), and iE[||xfc||2] < 1 for A; = 1,2. The channel is parameterized by SNR and INR, both 
growing to infinity such that INR = SNR'' as SNR—)- cx), where p > defines the relative strength of 
the direct and interference paths. 

Letting Csum(SNR, p) denote the sum capacity for given SNR and p, the sum symmetric GDoF is 
defined by 

Cs„n,(SNR,p) 

d,,m{p) = lim — - — — — — . (66) 

SNR^oo log SNR 

The main result of this section is given by: 

Theorem 4: For the Gaussian Network-Coded CIC, the sum symmetric GDoF is given by 



4,m(p) = 1 + p. 



(67) 



d,um{p) 



Proof: See Appendix [A] ■ 

In order to demonstrate the benefit gain of the mixed message at the non-cognitive transmitter, we 

compare the sum GDoF of Gaussian IC and Gaussian CIC. The GDoF of Gaussian IC is computed in 

|[T|, and it is given by 

' 2(1 -p), 0<p<i 

2p, \<P<1 

2-p, i < P < 1 (68) 

p, 1 < p < 2 

2, p>2. 

Also, from the constant gap result in [20] , we can immediately compute the sum GDoF of Gaussian CIC 

as 

S - p, p < 1 

p, p>l. 

The sum symmetric GDoF of these three channel models are shown in Fig. |2] 

It is also immediate to observe that the sum GDoF of full-cooperation is given by 



dsum(/5) 



(69) 



d,unXp) = 2 X max{l,p}. 



(70) 



In this case, the upper bound can be obtained from the 2x2 MIMO capacity with full CSI, and an easily 
analyzable achievable scheme consists of employing simple linear precoding given by BH^^, where 



B 



1 
1 



for p < 1 and B 



1 

1 



for p > 1. 



(71) 
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Fig. 6. Two-User Gaussian networks with coordination, cognition, and two hops. 



Observing that (67i and (70 1 coincide for p = 1, we conclude that the sum DoF of the Network-Coded 
CIC coincides with the sum DoF of full-cooperation, while the sum GDoF is strictly worse than full 
cooperation when p / 1. Furthermore, the network-coded cognition yields higher sum GDoFs than the 
conventional cognition when p > 1/2 and higher sum GDoFs than the standard IC when p > 1/3. 

Remark 3: Apparently, having a rank-1 linear combination of both messages at transmitter 2 instead 
of just message 2 hurts for small p (weak interference) and it is helpful in the intermediate to strong 
interference regime. Obviously, in a system where the backhaul network is rate-constrained but can be 
optimized with respect to the employed network code used, one would dispatch to the non-cognitive 
transmitter its own message only if the wireless segment operates in the regime of weak interference, 
and a linear combination of the two messages if it operates in the medium or strong interference regimes, 
thus obtaining the upper envelope of the conventional and Network-Coded CIC sum GDoF. 



IV. Two-User MIMO IC: Coordination, Cognition, Two-Hop 

In this section, we study three communication channels (see Fig. [6|l with the two-user MIMO IC 
as a building block, namely, Network-Coded ICC (representative of a cellular system downlink with 



interference coordination), Network-Coded CIC (the MIMO generalization of the model of Section III 1, 



and 2 X 2 X 2 IC (a canonical two-flows two-hop network that has attracted considerable attention in recent 
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literature |22|, |26|). In all these models, we assume that all nodes have M transmit/receive antennas. 
Let w^j £ G Fg denote the independent messages intended for destination k, for /c = 1,2. For simplicity 
of exposition, we define the message matrix Wjr. with rows w^ i, • • • , w^ ^^, where W;, ^ can be all-zero 
vectors for ^ > S'fc if user k has Sk independent information messages. 

In the Network-Coded ICC, the source has no knowledge of the CSI and can deliver fixed (i.e., not 
dependent on the wireless channel matrices) linear combinations of the information messages to each 
transmitter, such that each transmitter k knows M linear combinations as Sfci W]^ ® Sfc2W2, for suitable 
integer matrices S^j. In the wireless channel, a block of n channel uses of the discrete-time complex 
baseband MIMO IC is described by 





Xi 




Zi 






+ 






LX2 J 




U2 J 



(72) 



Xi _ Fii F12 

X2 F21 F22 

where the matrices X^, and Y^, contain, arranged by rows, the channel input sequences Xfe £ ^ C^^", the 
channel output sequences y, ^ G C^^", and where Fjfc G ([^MxH ^q^q^q^ t^e channel matrix between 



transmitter k and receiver j. The Network-Coded CIC has the wireless channel component given in (72 1, 



but in this case the two transmitters have different knowledge on the messages. In particular, transmitter 1 
(the cognitive transmitter) knows both messages W]^ , W 9 and transmitter 2 (the non-cognitive transmitter) 
only knows linear combinations S2iW]^ © 892 Wq, where the rank of the linear combinations is not 
sufficient to recover the individual messages. Finally, we consider the 2x2x2 IC, as shown in Fig. [6] 
(c), where each transmitter k (referred to as "source" in this relay setting) has a message for its intended 
destination k, ior k = 1, 2. In this model, the first hop is also described by ( [72] ) and in the second hop 
a block of n channel uses of the discrete-time complex MIMO IC is described by 





X3 




z, 






+ 






X, 




z, 



(73) 



F33 F34 

F43 F44 

where we denote the two transmitter-receiver pairs in the second hop by /c = 3, 4, and where Z^ contains 
i.i.d. Gaussian noise samples ~ CJ\f{0, 1). We assume that the elements of Fjk are drawn i.i.d. according 
to a continuous distribution (i.e., Gaussian distribution). The channel matrices are assumed to be constant 
over the whole block of length n and known to all nodes, and we consider a total power constraint equal 
to -Psum at each transmitter (both sources and relays). 

Before stating the main results of this section, it is useful to introduce the following notation. With 
reference to Section II-B for a set of modulo- A additive noise channel of the type (|4]l, induced by nested 
lattice coding, by the channel matrix H^Cfc and by the integer combining matrix Bjt with columns h^/, 
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for k = 1,2 and £ = 1, . . . , L^, for some integer Lk, we define 



i2comp(HfcCA..,Bfc,SNR) = min < log 



SNR 



l,...,Lfc 



a 



eff,kl 



where 



„2 



Also, we define the constant matrices: 



n2 



■^22 



Olx(M-l) 
[(Af-l)x(M-l) 



'■(M-l)x(A'/-l) 







Ix(Af-l) 



(74) 



(75) 



(76) 



(77) 



With this notation, we have: 

Theorem 5: For the Network-Coded ICC and Network-Coded CIC, PCoF with CIA can achieve the 
symmetric sum rate of (2M — 1)R with all messages of the same rate given by 



R= min {i?comp(HfcCfc,Bfc,SNR)} 

k=l,2 ^ ^^ 



(78) 



for any full-rank integer matrices Ai e Z[j]*^^^, As G Z[j](*^-i)^(^^-i) and Bi,B2 € Z[j]*^^^'^ and 



any alignment precoding matrices V^ satisfying the alignment conditions in (87i, where 



Hfe 
SNR 



FfciVi, 



mm 



' fc Afc A^- V ^■ 



Ai Cfc2A2 



fc=i72 I V.A.AHvH 



(79) 
(80) 



Theorem 6: For the 2x2x2 IC, PCoF with CIA can achieve the symmetric sum rate of (2M — \)R 
with all messages of the same rate given by 

i?= jmin{iicomp(HfcCfc,Bfc,SNR)}, min{i?comp(HfcCfc,Bfc, SNR')}) , (81) 

for any full-rank integer matrices Ai,A3 G Z[j]^-^^*^ As, A4 G ^[j]{Af-i)x(M-i) ^j^^ g^ ^ 
Z[j]*^^^^, A; = 1, ... ,4, and any alignment precoding matrices Vjt satisfying the alignment conditions 
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in dSTll, where 



Hfc 
Hfc 

SNR 
SNR' 



FfciVi, Cj, 



min 



K, 



Ai CfcoA 



fe=l,2lVfcAfcAHVH 



min 



P«, 



fe=3,4 I VfcAfcAHv" 



fc2A2 



A, C 



(fc-2)2-^4 



/c = 1,2 
, A; = 3, 4 



(82) 
(83) 

(84) 
(85) 



The next result shows that the per- message rate R grows as log SNR when Psum — ^ oo, thus obtaining 
an achievable sum DoF result for all the above channel models: 

Corollary 1: PCoF with CIA achieves sum DoF equal to (2M - 1) for the Network-Coded ICC, 
Network-Coded CIC, and 2x2x2 IC, when all nodes have M multiple antennas. 

Proof: See Appendix [B] ■ 

For the Network-Coded CIC, we can improve the DoF by appropriately combining the DPC and PCoF 



as done in Section III-C for single antenna case. Exploiting this idea, we obtain: 

Theorem 7: For the Network-Coded CIC, the sum DoF is equal to M when all nodes have M multiple 
antennas. 

Proof: See Appendix |C] ■ 

The proofs of Theorems [5] and [6] are provided in Sections IV-A and IV-B Our achievable scheme is 
based on the extension of the PCoF approach to the MIMO case. This scheme consists of two phases: 
1) Using the CoF framework, we transform the two-user MIMO IC into a deterministic finite-field 
IC. 2) A linear precoding scheme is used over finite-field to eliminate the interferences (see Figs. [7]). 
The main performance bottleneck of CoF consists of the non-integer penalty, which ultimately limits 
the performance of CoF at high SNR [39|. To overcome this bottleneck, we employ CIA in order to 
create an "aligned" channel matrix for which exact integer forcing is possible, similarly to what done in 
Section UPbI 

Specifically, we use alignment precoding matrices Vi and V2 at the two transmitters such that 



HtC 



k^ki 



(86) 



FfciVi Ffc2V2 

where 'H.k £ C"^ ^"^ and Cjt G Z[j]*^^^^*^. Linear precoding over the complex field may produce a power- 
penalty due to the non-unitary nature of the alignment matrices, and this can degrade the performance 



->MxM 



at finite SNR. In order to counter this effect, we use Integer Forcing Beamforming (IFB) |12|. The main 
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idea is that V^ can be pre-multiplied (from the left) by some appropriately chosen full-rank integer 
matrix A^ since its effect can be undone by precoding over Fg, using [A^Jg. Then, we can optimize 
the integer matrix in order to minimize the power penalty of alignment. The optimization of alignment 



and IFB in order to obtain good finite SNR performance is postponed to Section VI The details of the 
coding scheme are given in the following sections. 

A. CoF Framework based on Channel Integer Alignment 

In this section we show how to turn any two-user MIMO IC into a deterministic finite-field IC using 



the CoF framework. Consider the MIMO IC in (72). For A: = 1, 2, let W^^ = S^iW^ Sfc2W2 denote 
the network coded messages at transmitter k. We let w^^, ^ G F^ denote the £-th row of W^^, and 
we let W.,,^, W^ have dimension M x r and W^^, Wg have dimension (M — 1) x r. The precoding 



matrices S^i, 8^2 over ¥q will be determined in Section IV-B We let Vi = [vi i, . . . , vi^m] G C ^ 



and V2 = [v2,i, . . . , V2,M-i] € C^^^*-*^ ^' denote the precoding matrices used at transmitters 1 and 2, 
respectively, chosen to satisfy the alignment conditions 

F11V1/+1 = F12V2/ 

F21V1/ = F22V2/ (87) 



for £ = 1, . . . , M — 1. The feasibility of conditions ([87]) is shown in p6[ for any integer M > 2, almost 
surely with respect to the continuously distributed channel matrices {Fj^}. 

Let Ai G Z[j]^^^^^ and A2 G Z[j](^"^)^(*^"^) denote full rank integer matrices (the optimization 



of which in order to minimize the transmit power penalty is discussed in Section VI). The transmitters 
make use of the same lattice code C of rate R, where A is chosen such that u^ = SNR. Then, CoF 
based on CIA proceeds as follows. 
Encoding: 

• Each transmitter k precodes its messages over Fg as 

W;^ = [Afc]giW,^, A; = 1,2. (88) 

Then, the precoded messages (rows of W^^) are encoded using the nested lattice codes as t'^^ = 
/(w^^ J. Finally, the channel input sequences are given by the rows of 

X'fc = VfcAfcX'fc, (89) 

where X'^^. has rows x'^ ^ = [t'^ ^ + d^_^] mod A. 
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Due to the sum-power constraint equal to Psum at each transmitter, the second moment of coarse lattice 
(i.e., SNR) must satisfy 



HArH 



SNR • tr(VfcAfcA^Vn < Psum for k = 1,2. 



Thus, we can choose: 

Decoding: 

• Receiver 1 observes: 



SNR 



mm 



tr(VfcAfcAHvH) 



: k = l,2} . 



(90) 



(91) 



(a) 



FiiXi + F12X2 + Zli 
Fiiyi_[ ImxM C12 ] 



AiX'i 

A2X2 



+ Zi 



HiCi 



X'2 



+ Zi 



(92) 
(93) 

(94) 



where (a) follows from the fact that the precoding vectors satisfy the alignment conditions in ( 87 1 

and Ci = [ Ai C12A2 ]■ 

Similarly, receiver 2 observes the aligned signals: 



^21X1 + F22X2 "I" ^2 
F2lVi[ ImxM C22 ] 

'x'l 

X'2 



AiX'i 

A2X'2 



+ Zn 



H2C2 



+ Zn 



(95) 
(96) 

(97) 



where C2 = [ Ai C22A2 ]■ 
Notice that the channel matrices in (94i and ([97]) follow the particular form in ([2]). Following the 
CoF framework in ^ and (lOl, if i? < i?comp(HfcCfc, B^, SNR), receiver k can decode the M linear 
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combinations with full-rank integer coefficients matrix B^: 

W' 



u. 



[B%[C 



k\q 



Wl 



■T2 



[B 



k\q 



W r-oHi 

- [^k\q 



[Al], [Cfc2]g[A2], 



w: 



Ti 



Wi 



T2 



i-MxM 



^k2\q 



w 
w 



■Ti 



■T2 



(98) 



(99) 



(100) 



where (a) is due to the precoding over Fg in (88). Let W ^^ = [Bj^jJ^U^ and W ^^ denote the first 
M — 1 rows of [B2]q^^U2- The mapping between {W^^, W^^} and {W^^, W^^} defines a deterministic 



finite-field IC given by: 



where the system matrix is defined by 



W 



Ti 



w 



T2 



Q 



sys 



w 



■Ti 



w 



-T2 



(101) 



Q 



sys 



Ia/xM Qi2 

Q2I I(M-l)x(M-l) 



and where 



Q 



12 



Olx(M-l) 
[(J\/-l)x(A/-l) 



Q 



21 



I(A/-l)x(M-l) 0(Af-l)xl 



(102) 



(103) 



Notice that the system matrix is fixed and independent of the channel matrices, since it is determined 
only by the alignment conditions. 



B. Linear Precoding over deterministic networks 

In this section we determine linear precoding schemes to eliminate the interferences in the finite-field 
domain. Recall that transmitter 2 sends only M — 1 messages in order to use CIA. Accordingly, Qgyg in 



( 101 1 has dimension (2M - 1) x (2M - 1). 



1) Network-Coded ICC: In this model, the source can deliver linear combinations of information 
messages with coefficients Qsyg. We have: 

Lemma 3: The system matrix Qsys defined in (102i is full-rank over ¥q. 
Proof: The determinant of Qsys is given by 



det(Q; 



sys; 



det(lMxM)det(I(M-i)x(A/-i) © (-Q21Q12)) 
det(I(M-i)x(A/f-i) © (-Q21Q12)) = 1, 



(104) 
(105) 
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Source 1 



Relay 1 



Destination! 



Wl^l 



«'1.2 



■"'2.1 




W^l.l 
«;i,l ffi (W)l,2 ©«'2,l) 



preceding 



Source 2 



-(wi,i ffi 102,: 



Relay 2 




Destination 2 



Fig. 7. A deterministic noiseless 2x2x2 finite-field IC. 



since I 



-Q21Q12) is a lower triangular matrix with unit diagonal elements. 



(M-l)x(M-l) ^ 

Such precoding yields immediately W.^^ = Vi f, for /c = 1,2. This proves Theorem 5 for the Network- 
Coded ICC. 

2) 2x2x2 IC: We use the CoF framework based on CIA illustrated in Section |IV-A| in order to 
turn each hop (i.e., a two-user MIMO IC) into a deterministic finite-field IC defined by Qsys in (102i. 
At the two sources, no precoding is used such that W ^^ = W^., for k = 1,2. Hence, the deterministic 
finite-field IC corresponding to the first-hop of the 2 x 2 x 2 IC network has outputs W ^ , W^ related 



to Wi and W2 by ( [TOT) . 

Relays 1 and 2 perform precoding of the decoded linear combination messages such as W ^„ = Mi W^^ 



and W ^ = M2 W^ , where the precoding matrices Mi and M2 are defined in Lemma 4 Operating in 



a similar way as for the first hop, the second hop deterministic finite-field IC is given by 



W 



■T3 



Q 



sys 



w 
w 



■T3 



-T4 



(106) 



Concatenating the two hops, the end-to-end finite-field deterministic network is described by 



W 



T3 



w 



-T4 



Q 



sys 



Wi 



(107) 



Ml 

*«4sys 

M2 Wo 

Lemma [4] shows that the decoded linear combinations are equal to its desired messages at destination 1 
and are equal to the messages with a change of sign (multiplication by — 1 in the finite-field) at destination 
2 (see Fig. |7]l. This proved Theorem [6] 
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Lemma 4: Choosing precoding matrices Mi and M2 as 

Ml = (lMxMe(-Ql2Q2l))"^ 

M2 = -(I(M-l)x(M-l)®(-Q2lQl2))"' 

the end-to-end system matrix becomes a diagonal matrix: 



Q 



sys 



Ml 
M2 



Q 



sys 



MieQl2M2Q21 MiQi2eQl2M2 

Q2iMi©M2Q2i Q2iMiQi2©M2 



ImxA/ 








'■(M-l)x(M-l) 



Proof: From the Matrix Inversion Lemma [40, Thm 18.2.8], we can rewrite Mi and M2 as 



Ml 
M2 



"-AfxM 



Ql2(I(M-l)x(M-l) © (-Q21Q12)) Q2I 



'-(Af-l)x 



(A/-l)eQ2l(I 



MxM 



(-Ql2Q2l))-'Ql2) 



(108) 
(109) 

(110) 

(111) 

(112) 
(113) 



• Canceling the interferences: 

MiQi2©Ql2M2 = (lMxMe(-Ql2Q2l))"^Ql2eQ 



12(,-li(A/-l)x(Af-l) 



(-Q21Q12)) 



-l^ 



Q12 e Ql2(I(Af-l)x(Af-l) e (-Q2lQl2))"^Q2lQl2 

©Ql2(-(I(M_l)x(A/-l) e (-Q2lQl2))"^) 

Q12 e (-Ql2)(I(Af-l)x{Af-l) e (-Q2lQl2))"n(-Q2lQl2) e I(M-l)x{Af-l)) 







Afx(Af-l) 



Q2lMi©M2Q21 = Q2ieQ2lQl2(I(Af-l)x{Af-l)®(-Q2lQl2))"^Q21 

e(-(I(Af-l)x{Af-l) e (-Q2lQl2))"^Q2l) 



Q 



21 



'-(Af-l)x 



(Af-i) e (-Q2iQi2))(i(Af-i)x{A^-i) e (-Q21Q12)) ^(-Q2l) 







(A/-l)xAf- 



Preserving the desired signals: 

Mi©Qi2M2Q21 = lAfxA//eQl2(I{A.f-l)x(A/-l)©(-Q2lQl2))"^Q21 

©(-Ql2(I(A/-l)x(Af-l) ® (-Q2lQl2))"^Q2l) = I 
Q2lMiQi2©M2 = Q2l(lA/xA/®(-Ql2Q2l))"^Ql2®(-I(Af-l)x(A/-l)) 



MxM 



)(-Q2l(I 



MxM 



-Ql2Q2l))"'Ql2 



'-(A/-l)x(A/-l) 
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This completes the proof. 



C. Network-Coded CIC 



In this case we assume that transmitter 1 knows both messages W^ and W2, and transmitter 2 only 



knows M — 1 linear combinations SiW'^ © S^Wq, where Si, S2 G F, 



(M-l)x(M-l) 



are full-rank matrices 



and where W ^^ = Q21 W^^ contains the first M — 1 rows of W^^ (see the definition of Q21 in ( 103 1). In 
fact, we may assume that transmitter 2 also knows the interference-free message w^ ^^ (the last row of 
W^) but this is not used in our scheme. Transmitters 1 and 2 perform the precoding (over Fg) in the 
following way: 



W 



■Ti 



w 



T2 



Mi(Wi©Qi2S^iS2W2 
M2S^i(SiW;©S2W2) 



where Mi and M2 are defined in Lemma |4j From ( |10ip , we have: 



W, 



■Ti 



W, 



■T2 



Q 



sys 



^sys 
*«4sys 



Ml 




(a) 



Ml 



jMxAf 







M2 



M2 





tMxM /-» _c-i 



Q 



sys 



■Si S2 



Q12S1 S2 

Q21 S^ S2 

tM X a/ q 

sr^s2 

Wi 

Wo 



w 
w 



2 

Wi 

W2 



(114) 
(115) 

(116) 
(117) 
(118) 
(119) 



where (a) follows from Lemma |4] This shows that the decoded linear combinations are equal to its desired 
messages at receiver 1 and are equal to the messages with multiplication by full-rank matrix (— S^^S2)~^ 
(in the finite-field domain) at receiver 2. Based on this. Theorem [5] is proved for the Network-Coded 
CIC. 



V. Improving the sum rates using successive cancellation 

In this section we improve the sum rate in Theorem [5] by using CoF with successive cancellation. We 
focus on Network-Coded ICC to explain the proposed scheme. As shown before, precoding of information 
messages (over Fg) can eliminate interference from the other transmitter so that each receiver observes 
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full-rank integer linear combinations of its own intended lattice codewords in the corresponding MIMO 
modulo A channel. Once the network is reduced to two decoupled MIMO modulo A channels, each 
receiver can perform successive cancellation following the idea first proposed in |]4T|. In this way, 
each message can be recovered reliably at rate equal to the computation rate of the corresponding 
equation, without being constrained by the equal rate requirement (minimum of the computation rates of 
all equations). 

In order to achieve the different coding rates while preserving the lattice Z[j] -module structure, we 
use a family of nested lattice codes A C A2M-1 ^ ••• ^ Ai, obtained by a nested construction A 



as described in |[33| Sect. IV.A]. In particular, we let A^ = p^^g{Ce)T + A with A = Z"[j]T and 
with Cc denoting the linear code over F^ generated by the first r^ rows of a common generator matrix 
G, with r2M-i < • • • < ^1- The corresponding nested lattice codes are given by Ci = Ai Ci V\ for 
£ = !,..., 2M — 1. Let W;, £ G F!|^ be the zero-padded message to the common length ri. 



Encoding follows the same procedure outlined in Section IV Namely, we let 



W 



■Ti 



w 



T2 



^svs 



sys 



Wi 



Wn 



(120) 



in order to eliminate interference. Recall that each transmitter k precedes its messages over Fg as in ( 88 1 



where the integer matrix A^ is used for IFB with the purpose of minimizing the power penalty (see 
later). Then, the precoded messages are encoded using the densest lattice code £1 as t^^i. = /(w^^ J. 
Finally, the channel input sequences are given by the rows of X'^' defined in (89 1. Let t/.^ 



/(m 



denote the lattice codeword corresponding to information message w^ ^. Using lattice linearity, we can 
express the preceding in the complex (lattice) domain as: 



t; 

T' 









-^2 



giQsyl 



sys J 



Xi 



mod A. 



(121) 



From ( [94| ) and ( [97] ), each receiver k observes the integer aligned signals: 



HfcCjt 



X'o 



Hi 



i-MxM 



+ z, 



>fc2 



Ai 
A2 



X'l 

x; 



+ Z, 



(122) 



(123) 



The modulo A vector channel after applying the CoF receiver mapping (|4]) with integer coefficients matrix 
Bfc seen at each receiver A; = 1, 2 is given as follows: 
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At receiver 1 we have: 



Xi 



bH 



Ia/xM Ci2 



Ai 
A2 



T' 



+ Z^ff(HiCi,Bi) 



mod A 



(a) 



(b) 



rH 



t-MxM '->12 



^(Qsys 

BiT^ + Z^ff(HiCi,Bi 



+ Z_„(HiCi,Bi 



mod A 



mod A 



where (a) follows the (121 1, (6) is due to the fact that 



i-MxM '->12 



tsys 



MxM QMxAf-1 



(124) 
(125) 

(126) 



and where Z^ff(HiCi,Bi) denotes the M x n matrix of effective noises with rows 
z^ff(HiCi, bi £, Qi f), and the projection vector cti^i is determined as a function of HiCi,bi^f 
as said in Section III-BI 
Similarly, at receiver 2 we have: 



(a) 



bH 



bH 



B 



t-MxM '->22 



l-AfxM '->22 



Ai 
A2 



T' 



+ ZJH2C2,B2) 



mod A 



5(Qs;s) 



+ Z „(H2C2,B 



A 



T2 



+ Z,„(H2C2,B2) 



mod A. 



(127) 



(128) 



where t denotes some linear combination of lattice codewords, irrelevant for receiver 2, (o) follows 
from the fact that 



Ia/xA/ C22 



Q 



1 

sys 



qA/-1xA/-1 j 



(Af-l)x(A/-l) 



and where • denotes some non-zero vector in F, 



lx(2A.f-l) 



(129) 



In (128 1, Z^„(H2C2,B2) is defined 



similarly to Z^ff(HiCi, Bi). Receiver 2 can recover its M — 1 messages as long as it has M — 1 
full-rank linear combinations of its own messages. In order to remove the unintended messages 
collected in t, we choose B2 in the form: 



B 



BH 



(130) 
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where B2 G Z[j](^^ i)x(Af 1) j^ full-rank. Then, the first M — 1 observations of receiver 2 is given 
by 



B2^T2 



Zeff(H2C2,B2 



mod A. 



(131) 



From (125 1 and (131 1, we have that each receiver obtains a full-rank interference-free MIMO integer 
valued modulo A channel with effective additive noise. At this point, each receiver can perform successive 



cancellation |41 1, thus relaxing the minimum common computation rate constraint. Focusing on receiver 

1, we illustrate the successive cancellation procedure with given integer matrix Bi and computation rates 
{log^(SNR/cr^^jj ^^) : £ = 1, . . . , M}. The same procedure can be straightforwardly applied to receiver 

2, given the formal equivalence of ( |125| ) and ( 131| ). Without loss of generality, assume that 



2 2 

^eff,l,l < • • • < '7cff,l,M- 



(132) 



Letting Ri^i denote the rate of ^-th message of user 1, we have Ri^£ = rj^ ^ for some ji £ € {1, . . . , 2M — 
1}, i.e., the ^-th message of user 1 is encoded using nested lattice codes Cj^ 1,. For the time being, 
we assume that i?i 1 > Ri^2 > • • • > Ri,ai (the ordering will be determined later on, according to 
column permutation of Bi that is required for successive cancellation). Receiver 1 can reliably decode 

Si = [bJ^j^T]^] mod A as long as 



Ri,i < log+ 



SNR 



^ff,i,i 



(133) 



Then, it proceeds to decode Sg = [bJ^gXi] mod A. Using the previously decoded s^, it can perform the 
cancellation: 



[y, + 62181] mod A 



[§.2 +62181 +z^„(HiCi,bi,2,ai,2)] mod A 
[§2 +Zeff(HiCi,bi,2,ai,2)] mod A. 



(134) 
(135) 



Here, 621 G Z[i] is chosen so that [(bi^2(l) + e2ibi^i(l))] mod pZ[j] = where b(j) denotes the j-th 
element of vector b. Then §2 does not include t^ and hence receiver 1 can reliably decode §2 as long as 

' SNR \ 



Ri,2 < log^ 



(136) 



eff,l,2 , 



Now, receiver 1 can obtain S2 such as Sg = [§2 — 62181] mod A. Receiver 1 can decode the remaining 
linear combinations s^ for ^ > 3 in a similar manner. Namely, before decoding s^, receiver 1 adds 



■sr^^-l 
Z^7 = l 



mod A (i.e., an integer valued linear combinations of previously decoded s 's). Here the 



coefficients e£j are chosen so that the impact of ti, . . . , t£_i is canceled out from s^. Assuming that such 



34 



coefficients exist, receiver 1 can decode §« 



mod A as long as Ri^i is less than the 



corresponding computation rate of the £-th equation. 

From |]4n Lemma 2], such cancellation coefficients exist for at least one column permutation vector 
TTi of Bi- Accordingly, all M linear combinations can be decoded as long as 

'snr' 



Ri,m{i) < log^ 



for £ = l, 



M. 



a 



(137) 



eff,!,^ 



M 



Therefore, the sum rate X]£=i ^^S 



SNR 



is achievable. Similarly, there exists at least one column 



permutation vector 7r2 of B2 for which all M — 1 linear combinations at receiver 2 can be decoded as 
long as 

for£ = l,...,M-l, (138) 



R2,n,ie) < log+ -2 



0" 



cff,2/^ 



where we let R2 . 



'32,1 



, for some index mapping J2,£ S {!,..., 2M — 1}, denote the rate of 



th message of user 2. The exactly same procedure can be applied to the Network-Coded CIC. The 
successive cancellation replaces the sum-rate formula in Theorem [5] as follows: 

Theorem 8: For the Network-Coded ICC and Network-Coded CIC, PCoF with CIA can achieve sum 
rate as 

^ ,/snr\ '^^ ,/snr\ 



a , 



for any full rank integer matrices Ai G Z[i]*^^*^A2 G z[i](^-^-i)^(*^-^), Bi G Z[j]*^^*^B2 G 



^^jj(Af i)x(A/ 1)^ ^^^ ^j^y alignment precoding matrices V^ to satisfy the alignment conditions in (87 1, 
where 





B2 - 


BH 








Hfc = ] 


FfciVi,Cfe = 


Ai Cfe2A2 




SNR = 


J ^sum 1 




2'5lV,A,A?VHJ' 


and where 











k = 1,2 



(140) 
(141) 

(142) 



cr; 



eB,k/ 



b" C,(SNR-il + C^HHHfcCfc)-iC^bfc, 



/c = 1,2. 
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VI. Optimization of achievable rates 



In this section we optimize the integer matrices A^ and B^ in Theorems 5][8 by assuming that the 



precoding matrices V^ are given. The dimensions of A^ and B^ can be either MxM or (Af — 1) x (Af — 1), 
depending on k. Since this does not change the optimization problem, we will drop the index k and just 
consider dimension M. The power-penalty optimization with respect to A takes on the form: 

M 

argmin tr ('vAA^V^) = ^ ||Va^f 

£=1 

subject to A is full rank over Z[j] (143) 

where a^ denotes the ^-th column of A. Also, the minimization of the effective noise variance with 
respect to B takes on the form: 

argmin max 1 1 1 Lb^ 1 1 } 
subject to B is full rank over Z[j] (144) 

where L denotes a square -root factor of (SNR~^I + H'~'H)^^, H denotes an aligned channel matrix and 
hg denotes the i-th column of B. 

We notice that problem ( 143| ) (resp., (144i) is equivalent to finding a reduced basis for the lattice 



generated by V (resp., L). In particular, the reduced basis takes on the form VU where U is a unimodular 
matrix over Z[i]. Hence, choosing A = U yields the minimum power-penalty subject to the full rank 
condition in ( 143 1. In practice we used the (complex) LLL algorithm [ |42j | q] with refinement of the LLL 



reduced basis approximation by Phost or Schnorr-Euchner lattice search | |44| . We let u^ denote the ^-th 
column of U. Phost or Schnorr-Euchner enumeration generates all non-zero lattice points in a sphere 
centered at the origin, with radius equal to d = max^ ||Vu£|p. This radius guarantees the existence of 
solutions because of having the trivial solution {ui, . . . , um}- Define the set of integer vectors such that 
the corresponding lattice points are in a sphere with radius d by 

{A, W) = {(a^, Wi) : Wi = II Va^f < d}. (145) 

The following lemma shows that the greedy algorithm (Algorithm 1) finds a solution (i.e., M linearly 



independent integer vectors) to the problem (143 1 (or ( 144i). 



We can also use the HKZ and Minkowski lattice basis reduction algorithm (see |43| for details). 



§ 
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Lemma 5: For given {A, W) defined in ( 145 1, Algorithm 1 finds a solution to the following two 
problems: 



mm 

ScA 



E 



Wi 



subject to {a^ : £ e S} are linearly independent 

151 = M. 



and 



min raaxiwe : i £ S} 
ScA ^ ' 



subject to {ai : £ e S} are linearly independent 

151 = M. 



(146) 

(147) 
(148) 

(149) 
(150) 
(151) 



Proof: The first problem consists of the minimization of linear function subject to a matroid 
constraint, where the matroid A^(ri,X) is defined by the ground set il = [1 : |^|] and by the collection 



of independent sets I = {S C ^ : {ai : i £ S} are linearly independent}. Rado and Edmonds |45 1, |46| 
proved that a greedy algorithm (Algorithm 1) finds an optimal solution. In case of the second problem, 
we provide a simple proof as follows. Suppose that the indices of elements in A are rearranged according 
to the increasing ordering of the weights we- The problem is then reduced to finding the minimum index 
i^ such that {ai, . . . , a^t} includes the M linearly independent vectors. This is precisely what Algorithm 
1 does. ■ 

Algorithm 1 The Greedy Algorithm 

Input: (^,$) = {(a^,^^) : a^ G Z[j]^'^''\wi G Z+} 

Output: ScA with \S\ = M 

1) Rearrange the indices of vectors in A such that wi < W2 < ■ ■ ■ < W|^| 

2) Initially, ^ = 1 and 5 = 

3) If Rank(5 U {£}) > Rank(5) then 5 ^ 5 U {£} 

4) Set^ = ^ + 1 

5) Repeat 3)-4) until \S\ = M 
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H Time Sharing 

PCoF witli CIA (w/o opt) 
-• — PCoF witli CIA 




P /M [dB] 
sum 



Fig. 8. Performance comparison of PCoF with CIA and time-sharing with respect to ergodic symmetric sum rates for 2 x 2 x 2 
MIMO interference channel with M = 2. 



A. Finite SNR Results 

We evaluate the performance of PCoF with CIA in terms of its average achievable sum rates. We 
computed the ergodic sum rates by Monte Carlo averaging with respect to the channel reaUzations 
with i.i.d. elements ~ CJ\f{0, 1). Recall that we consider a total power constraint equal to Psum at 
each transmitter (both sources and relays). We first consider the symmetric sum rates in Theorem |6] for 
2x2x2 Gaussian IC. For comparison, we considered the performance of time-sharing where IFR is 
used for each M x M MIMO IC. We used the IFR since it is known to almost achieve the performance 
of joint maximum likelihood receiver 1321 and has a similar complexity with PCoF. In this case, an 
achievable symmetric sum rate of time-sharing is obtained as 



i? = min{i?comp(Fii,Bi,SNR),i?comp(F33,B2,SNR)} 



(152) 
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100 



-B — Time Sharing (M=2) 
-♦ — PCoF with CIA (M=2) 
-^ — Time Sharing (M=3) 
-e — PCoF with CIA (M=3) 




P /M [dB] 
sum 



Fig. 9. Performance comparison of PCoF with CIA and time-sharing with respect to ergodic symmetric sum rates for MIMO 
interference coordination channel. 



for any full-rank matrices Bi,B2 € Z[j]^^^^^, where SNR = Psum- Here, we used 2Psum for power 
constraint since with time-sharing each transmitter is active on only half of the time slots. For PCoF with 



CIA, we need to find precoding matrices for satisfying the alignment condition in ( 87 1. For M = 2, the 
conditions are given by 

FiiVi^2 = Fi2V2,i and F2ivi^i = F22V2,i. 
For the simulation, we used the following precoding matrices to satisfy the above conditions: 



Vi 



F21 F22I F^-^ F12I 



and V2 



1. 



(153) 



Also, the same construction is used for the second hop. We used the complex LLL algorithm to optimize 
integer matrices, yielding lower bound on achievable rate in Theorem [6] Since source 1 (or relay 1) 
transmits one more stream than source 2 (or relay 2), the former always requires higher transmission 
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power. In order to efficiently satisfy the average power-constraint, the role of sources 1 and 2 (equivalently, 
relays 1 and 2) is alternatively reversed in successive time slots. In Fig. [8j we observe that PCoF with 
CIA can have the SNR gain about 5 dB by optimizing the integer matrices for IFR and IFB, comparing 
with simply using identity matrices. Also, PCoF with CIA provides a higher sum rate than time-sharing 
if SNR > 15 dB, and its gain over time-sharing increases with SNR, showing that in this case the DoF 
result matters also at finite SNR. 

In addition, we evaluate the performance of the Network-Coded ICC with respect to sum rates. The 
achievable sum rate is given in Theorem [8] For comparison, we considered the performance of time- 
sharing where the achievable sum rate is equal to the capacity of the individual (interference free) MIMO 
channel with full CSI at both transmitter and receiver, given by 



M 



Rsum = Xll°g(l + PM^kk^kk)) 



(154) 



where P^ is obtained via water-filling over the eigenvalues of F^^F^fc, denoted by Xi{F^j,Fkk), such as 

1 



Pf 



fJ- 



(155) 



with /i is chosen to satisfy the total power constraint X]f=i ^i — 2i^sum- Again, with time-sharing the 



per-slot power constraint is 2Psum- For PCoF with CIA, we used the same construction method in ( 153 1 
for M = 2. Also, we considered the case with Af = 3 and in this case, the alignment conditions are 
given by 



FllVi^2 = 


= Fi2V2,l 


FllVi,3 = 


= Fi2V2,2 


F2lVi,i = 


= F22V2,l 


F21V19 = 


= F22V2 2 



For the simulation, we used the following precoding matrices to satisfy the above conditions: 

Vi = 



21*22*12*11-'- -■- -^11 -T 12-r22 ■•^21-'- 



Vo 



F12F111 F22F211 



(156) 

(157) 



Fig. |9] shows that PCoF with CIA provides a higher sum rate than time-sharing, having a larger gap as 
SNR increases. 
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Appendix A 
Proof of Theorem|4] 

A. Achievable scheme 

We use the achievable rates given in Theorem [3] It is immediately shown that the achievable GDoF 
of message 1 (cognitive user), obtained by 

^'^ SNR^oo logSNR 

In this proof, we show that message 2 (non-cognitive user) achieves GDoF equal to p, by carefully 
choosing the power scaling factor /3 G "P. The effective channel for Scaled PCoF is given by h(/3) = 
[/i2i"\/INR, /3(/i22"\/SNR — ai_MMSE(/ii2^2i/^ii)SNR^^2)] and can be rewritten as 

h(/3) = SNR^/2[/i2i,/3/i22] (159) 

where ^22 = /i22SNR(i~^)/2 _ hSHR^P-^^/"^ and h = ai,MMSE(/ii2/i2i//iii). Here, we choose /3 = /3* = 
^21/(^227), where 7 > 1 is an integer with 7 = [|/i2i 7^2211 ^ ^+- This produces a kind of "aligned" 
channel: 

\i = SUR>''\h2i,h2ih]. (160) 



Letting 61 = 7, and 62 = 1. the effective noise in (61 1 is obtained by 



This shows that non-integer penalty is completely eliminated. Also, we can use the zero forcing precoding 
over Fq since the chosen integer coefficients 61 = 7 and 62 = 1 are non-zero. From this, we have the 
lower bound on the achievable rate of Scaled PCoF: 

maxi?2(/3) > i?2(/3*) = p\og{\h2i\^SHR) - 21og(7). (162) 

The lower and upper bounds on 7 is given by 

/121 



1 < 7 < 1 + 



(163) 



/i22SNR(i-'')/2-/iSNR(^-i)/2 
where 7 converges to a const as SNR — ;• 00. Finally, the achievable GDoF of the non-cognitive transmitter 
is derived as 

Mp) > lim !^^^f*l = p. (164) 



From ( 158 1 and ( 164 1, the achievable sum GDoF is 1 + p. 
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B. Converse 

For given rates Ri and R2, we define i?min = min{i?i, i?2} and R/^ = max{i?i,i?2} — ^min- If 
Ri > R2 then Wi = (Mi, Ma) and VF2 = (M2, 0). In the reverse case, we have that Wi = (Mi, 0) and 
W2 = (M2, Ma). In both cases, the non-cognitive transmitter knows the Unear combination, Wi © W2 = 
(Mi©M2, Ma). From the well-known Crypto Lemma, the Mi©M2 is mutually statistically independent 
of Ml, as well as Mi © M2 is mutually statistically independent of M2. In this proof, we derive the 
upper bounds on i?min and i?max = -Rmin + ^A. First, we derive the upper bound on the minimum rate 

-'tmin • 

ni^min = i?(Mi) = i/(Mi|Mi©M2,MA) (165) 

= H{Mi\Mi © M2, Ma) - i^(Mi|yi", Mi © M2, Ma) (166) 

+H{Mi\Y{', Ml © M2, Ma) (167) 

(a) 

< I {Ml ; Fi" I Ml © M2 , Ma ) + ne„ (168) 
= /i(yi"|Mi©M2,MA)-/i(yi"|Mi©M2,MA,Mi)+ne„ (169) 
^^ /i(yi"|X2",Mi©M2,MA)-/i(yi"|Xi",X^\Mi©M2,MA,Mi) + ne„ (170) 

< h{Y^\X^) - /i(yi"|Xf , X^\ Ml © M2, Ma, Mi) + ne„ (171) 
^^ h{Y{'\Xl^)-h{Y^\X'^,Xl^) + nen (172) 
= /(Xi";yi"|X2") + ne„ (173) 

< nlog(l + |/iiipSNR) + ne„ (174) 
where (a) follows from the Fano's inequality and data processing inequality as 

H{Mi\Y^,Mi © M2,Ma) < F(Mi|yi") < H{Mi\Mi) < ncn, (175) 

(b) follows from the fact that encoder 2 has (Mi © M2, Ma), therefore for any coding scheme X^ is a 
function of (Mi © M2, Ma), and (c) follows from the fact that there is a Markov chain 

(Mi,M2,MA)^(Xr,X^')^yi". 
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In the same manner, we get: 

nRrnin = H{M2) = H{M2\Mi®M2,M^) (176) 
= H{M2\Mi e M2, Ma) - H{M2\Y^, Mi M2, Ma) 

+H{M2\Y^,MieM2,MA) (177) 

< I{M2;Y^\Mi(BM2,MA)+nen (178) 

= I{X^;Y2^\X^) + nen (180) 

< nlog(l + |/i2iplNR) + ne„. (181) 



From ( |174[ ) and ( |181| ), we have 

i^min < min{log(l + |/iiipSNR),log(l + |/i2iplNR)}. (182) 

An obvious upper bound on Ri and R2 are given by 

nRi < I{X^,Xl^-Y^) + nen (183) 

< nlog(l + |/iii|2SNR + |/ii2|^INR) + nen (184) 
nR2 < IiX^,X^;Y^) + nen (185) 

< nlog(l + |/i2iplNR + |/i22pSNR) + nen. (186) 

Since Rmm + Ra = max{i?i, i?2}, we have: 

i^max < max{log(l + |/iii|2SNR + |/ii2plNR), 

log(l + |/i2i|^INR + |/i22|^SNR)}. (187) 



Using (182 1, (1871, and INR = SNR'', we have the upper bounds in the asymptotic case: 



lim 1; — "^T.r^ + ; — ™tTr^ I < minjl, p| + maxjl, p|, (188) 

SNR->oo VlogSNR logSNR; - ^ "^^ ^ "^^' ^ ^ 



yielding the upper bound on the sum symmetric GDoF as 



d,„„, = lim — — <l+ p. (189) 

SNR^oo logSNR 



This completes the proof. 
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Appendix B 
Proof of Corollary[T] 

For the DoF proof, we assume that -Psum goes to infinity and equivalently, SNR goes to infinity. In 
this proof, we will show that the individual messages rate R (equal for all messages) grows as log SNR, 
i.e.. 



lim 



R 



1. 



SNR^oo log SNR 

Assuming that we use exact IFR (see Section |II-B[ eq. Q), we have: 



i?comp(HfcCfc,Bfc,SNR) = log(SNR) - maxlog 



fH 



-l\Hi 



^kl\ 



(190) 



(191) 



This definitely shows that limsNR- 



i?,o„p(HfcC;.,B>o,SNR) 



^ gi^P — 1. Accordingly, R grows as log SNR. 

However, H^ must be full-rank in order to allow exact IFR. Since H^ = F^iVi for A; = 1,2 and 
Hfe = FfcsVa for k = 3,4, we need to show that Vi and V3 are full rank. For the alignment, we use 
the following construction method proposed in |26J: 

vi,£+i = (F^^ F12F22 F21) vi^i 



for 



(192) 

V2,^ = (F22'F2lFr/Fi2)'-^F22'F2lVi,i (193) 

1, . . . , M — 1. Once vi i is determined, other vectors are completely determined by the above 



equations. As argued in |26|, since the channel matrices are drawn form a continuous distribution then F = 

(Fj^j^^Fi2F22^F2i) has all distinct eigenvalues almost surely. From |47 Thm 1.3.9], F is diagonalizable 

such as 

' Ai 



E 



Aa/ 



E-i 



where the i-th column of E is an eigenvector of F associated with Aj. Choosing vi^i 
precoding matrix Vi can be rewritten as 

,Af-l 



El, the alignment 



Vi =E 



1 Ai 



1 Am 



A 



,M-1 



(194) 
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where J denotes the Vandermonde matrix. Therefore, the determinant of the Vi is computed by 

det(Vi) = det(E)det(J) (195) 

= det(E) II (A, -Ai)^0. (196) 

This shows that Vi is full rank . With the same procedure, we can show that V3 is full rank. 

Appendix C 
Proof of Theorem[7] 

We prove that PCoF with CIA and DPC can achieve the optimal 2M DoF. This scheme can be regarded 



as MIMO extension of Scaled PCoF and DPC proposed in Section [IirC] Consider the MIMO IC in (|72i. 
Let {w^ £ : ^ = 1, . . . , M} denote the independent messages to be intended for receiver k, ioi k = 1, 2. 
Without loss of generality, it is assume that transmitter 2 knows the W]^ © W2. Our achievable scheme 
proceeds as follows. 
Encoding: 

• Transmitter 1 independently produces the M lattice codewords ^21 = /{yv^n ®Yl.2e) ^'^^ ^ — 
1, . . . , M and transmits the channel input: 

X'2 = VX2 (197) 

where Xg = [T2 + ^2] mod A. 

• Transmitter 2 performs the DPC using the known interference signal F12X2 ^^ S^^'- 

Xi = [Ti - F-/F12X2 + Di] mod A (198) 

where T^ = /(— Wj^) denotes the lattice codewords corresponding to precoded messages — Wj^. 
Decoding: 

• Receiver 1 performs the modulo-lattice mapping as Y^ = [F^^^Y^ — D^] mod A that yields: 

Xi = [Ti - Ti + Xi + F^/Fi2X'2 - Di + F^-^Zi] mod A 

= [Ti + Xi - ([Ti - F1-/F12X2 + Di] mod A) + Fi^/Zi] mod A 
= [Ti + F5"/Zi] mod A. 

This shows that receiver 1 has interference-free channel, thus achieving the M DoF. 
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Receiver 2 applies the CoF receiver mapping (4 1 with integer coefficients B*"* = I I and scaling 
factor F^j^^, yielding 

= [Ti + T2 + ((F2-/F22 - Fr/Fi2)V - I)X2 + F2IIZ2] mod A 
= [Ti +T.2 + F2lZi2\ mod A 
= [/(W2) + F2-/Z2] mod A. 
where (a) is due to the fact that the precoding matrix is chosen as V = (F2x^F22 — F^]^^Fi2^ 



-1 



and (6) follows the precoding over Fg using the coefficient (—1) at the cognitive transmitter . Then, 
receiver 2 can achieve the M DoF. 
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